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(2) INFORMATION FOR SEQ ID NO:l: 
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(i) 

SEQUENCE CHARACTERISTICS: 

Q 1 

7 X 
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(A) 

LENGTH: 63 amino acids 

7 J 



7 ft 

(B) 

TYPE: amino acid 

73 



7 O 

(D) TOPOLOGY: linear 

97 



98 

(ii) MOLECULE TYPE: peptide 

99 



100 



101 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

102 



103 

Ala Ser Thr Leu Gly Ser Ser Thr Pro Lys Val 

104 

1 

5 10 

105 



106 

Pro Phe 

Gin Pro Pro Arg Glu Val His Val Gin 


15 
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107 20 25 30 

108 

109 Pro Pro Gin Lys He Glu He Phe Lys Ser He Glu Gly Xaa Ala Glu 

110 35 40 45 
111 

112 Gin Asn He Leu Val Xaa Leu Lys Pro Val Glu Lys Cys Trp Gin 

113 50 55 60 
114 

115 

116 (2) INFORMATION FOR SEQ ID NO: 2: 
117 

118 (i) SEQUENCE CHARACTERISTICS: 

119 

120 (A) LENGTH: 56 amino acids 

121 

122 (B) TYPE: amino acid 

123 

124 (D) TOPOLOGY: linear 

125 

126 (ii) MOLECULE TYPE: peptide 

127 

128 

129 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
130 

131 Asp Phe Leu Pro Asp Pro Ala Xaa Glu Gly Phe Asp Glu Gin Val Lys 

132 15 10 15 
133 

134 Glu Leu Arg Ala Arg Ala Lys Glu He Pro Asp Asp Tyr Phe Val Val 

135 20 25 30 
136 

137 Leu Val Gly Asp Met He Thr Glu Glu Ala Leu Pro Thr Tyr Gin Thr 

138 35 40 45 
139 

140 Met Leu Asn Thr Leu Asp Gly Val 

141 50 55 
142 

143 

144 (2) INFORMATION FOR SEQ ID NO: 3: 
145 

146 (i) SEQUENCE CHARACTERISTICS: 

147 

148 (A) LENGTH: 14 amino acids 

149 

150 (B) TYPE: amino acid 

151 

152 (D) TOPOLOGY: linear 

153 

154 (ii) MOLECULE TYPE: peptide 

155 

156 

157 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
158 

159 Asp Glu Thr Gly Ala Ser Leu Thr Pro Trp Ala Val Trp Thr 
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160 15 10 

161 

162 

163 (2) INFORMATION FOR SEQ ID NO: 4: 
164 

165 (i) SEQUENCE CHARACTERISTICS: 

166 

167 (A) LENGTH: 13 amino acids 

168 

169 (B) TYPE: amino acid 

170 

171 (D) TOPOLOGY: linear 

172 

173 (ii) MOLECULE TYPE: peptide 

174 

175 

176 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
177 

178 Asp Leu Leu His Thr Tyr Leu Tyr Leu Ser Gly Arg Val 

179 15 10 
180 

181 

182 (2) INFORMATION FOR SEQ ID NO: 5: 

183 

184 (i) SEQUENCE CHARACTERISTICS: 

185 

186 (A) LENGTH: 13 amino acids 

187 

188 (B) TYPE: amino acid 

189 

190 (D) TOPOLOGY: linear 

191 

192 (ii) MOLECULE TYPE: peptide 

193 

194 

195 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
196 

197 Asp Met Arg Gin lie Gin Lys Thr lie Gin Tyr Leu lie 

198 1 5 10 
199 

200 

201 (2) INFORMATION FOR SEQ ID NO: 6: 
202 

203 (i) SEQUENCE CHARACTERISTICS: 

204 

205 (A) LENGTH: 17 amino acids 

206 

207 (B) TYPE: amino acid 

208 

209 (D) TOPOLOGY: linear 

210 

211 (ii) MOLECULE TYPE: peptide 
212 
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213 
214 

215 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
216 

217 Thr Olu Asn Ser Pro Tyr Leu Gly Phe He Tyr Thr Ser Phe Gin Glu 

218 15 10 15 
219 

220 Arg 

221 

222 

223 (2) INFORMATION FOR SEQ ID NO: 7: 
224 

225 (i) SEQUENCE CHARACTERISTICS: 

226 

227 (A) LENGTH: 57 amino acids 

228 

229 (B) TYPE: amino acid 

230 

231 (D) TOPOLOGY: linear 

232 

233 (ii) MOLECULE TYPE: peptide 

234 

235 

236 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
237 

238 Asp Val Xaa Leu Ala Gin He Xaa Gly Thr He Ala Ser Asp Glu Lys 

239 1 5 10 15 
240 

241 Arg His Glu Thr Ala Tyr Thr Lys He Val Glu Lys Leu Phe Glu He 

242 20 25 30 
243 

244 Asp Pro Asp Gly Thr Val Leu Ala Phe Ala Asp Met Met Arg Lys Lys 

245 35 40 45 
246 

247 He Xaa Met Pro Ala His Leu Met Tyr 

248 50 55 
249 

250 

251 (2) INFORMATION FOR SEQ ID NO: 8: 
252 

253 ( i ) SEQUENCE CHARACTER I STI CS : 

254 

255 (A) LENGTH: 4 amino acids 

256 

257 (B) TYPE: amino acid 

258 

259 (D) TOPOLOGY: linear 

260 

261 (ii) MOLECULE TYPE: peptide 

262 

263 

264 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
265 
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266 Asp Asn Leu Phe 

267 1 
268 

269 (2) INFORMATION FOR SEQ ID NO: 9: 
270 

271 (i) SEQUENCE CHARACTERISTICS: 

272 

273 (A) LENGTH: 16 amino acids 

274 

275 (B) TYPE: amino acid 

276 

277 (D) TOPOLOGY: linear 

278 

279 (ii) MOLECULE TYPE: peptide 

280 

281 

282 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
283 

284 Xaa Xaa Phe Xaa Ala Val Xaa Gin Arg Leu Xaa Val Tyr Thr Ala Lys 

285 1 5 10 15 
286 

287 

288 (2) INFORMATION FOR SEQ ID NO: 10: 
289 

290 (i) SEQUENCE CHARACTERISTICS: 

291 

292 (A) LENGTH: 14 amino acids 

293 

294 (B) TYPE: amino acid 

295 

296 (D) TOPOLOGY: linear 

297 

298 (ii) MOLECULE TYPE: peptide 

299 

300 

301 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
302 

303 Asp Tyr Ala Asp lie Leu Glu Phe Leu Val Gly Arg Trp Lys 

304 1 5 10 
305 

306 

307 (2) INFORMATION FOR SEQ ID NO: 11: 
308 

309 (i) SEQUENCE CHARACTERISTICS: 

310 

311 (A) LENGTH: 54 amino acids 

312 

313 (B) TYPE: amino acid 

314 

315 (D) TOPOLOGY: linear 

316 

317 (ii) MOLECULE TYPE: peptide 
318 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Val Ala Asp Leu Thr Oly Leu Ser Gly Glu Gly Arg Lys Ala Xaa Asp 
15 10 15 

Tyr Val Cys Gly Leu Pro Pro Arg lie Arg Arg Leu Glu Glu Arg Ala 
20 25 30 

v 

Gin Gly Arg Ala Lys Glu Gly Pro Val Val Pro Phe Ser Trp lie Phe 
35 40 45 

Asp Arg Gin Val Lys Leu 
50 

(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1533 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GCTCACTTGT GTGGTGGAGG AGAAAAACAG AACTCACAAA AAGCTTTGCG ACTGCCAAGA 60 

ACAACAACAA CAACAAGATC AAGAAGAAGA AGAAGAAGAT CAAAAATGGC TCTTCGAATC 120 

ACTCCAGTGA CCTTGCAATC GGAGAGATAT CGTTCGTTTT CGTTTCCTAA GAAGGCTAAT 180 

CTCAGATCTC CCAAATTCGC CATGGCCTCC ACCCTCGGAT CATCCACACC GAAGGTTGAC 240 

AATGCCAAGA AGCCTTTTCA ACCTCCACGA GAGGTTCATG TTCAGGTGAC GCACTCCATG 300 

CCACCACAGA AGATAGAGAT TTTCAAATCC ATCGAGGGTT GGGCTGAGCA GAACATATTG 360 

GTTCACCTAA AGCCAGTGGA GAAATGTTGG CAAGCACAGG ATTTCTTGCC GGACCCTGCA 420 

TCTGAAGGAT TTGATGAACA AGTCAAGGAA CTAAGGGCAA GAGCAAAGGA GATTCCTGAT 480 

GATTACTTTG TTGTTTTGGT TGGAGATATG ATTACAGAGG AAGCCCTACC TACTTACCAA 540 
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372 ACAATGCTTA ATACCCTA6A TGGTGTACGT GATGAGACTG GGGCTAGCCT TACGCCTTGG 600 
373 

374 GCTGTCTGGA CTAGGGCTTG GACAGCTGAA GAGAACAGGC ATGGCGATCT TCTCCACACC 660 
375 

376 TATCTCTACC TTTCTGGGCG GGTAGACATG AGGCAGATAC AGAAGACAAT TCAGTATCTC 720 
377 

378 ATTGGGTCAG GAATGGATCC TCGTACCGAA AACAGCCCCT ACCTTGGGTT CATCTACACA 780 
379 

380 TCGTTTCAAG AGCGTGCCAC ATTTGTTTCT CACGGAAACA CCGCCAGGCA TGCAAAGGAT 840 
381 

382 CATGGGGACG TGAAACTGGC GCAAATTTGT GGTACAATCG CGTCTGACGA AAAGCGTCAC 900 
383 

384 GAGACCGCTT ATACAAAGAT AGTCGAAAAG CTATTCGAGA TCGATCCTGA TGGCACCGTT 960 
385 

386 CTTGCTTTTG CCGACATGAT GAGGAAAAAG ATCTCGATGC CCGCACACTT GATGTACGAT 1020 
387 

388 GGGCGTGATG ACAACCTCTT CGAACATTTC TCGGCGGTTG CCCAAAGACT CGGCGTCTAC 1080 
389 

390 ACCGCCAAAG ACTACGCCGA CATACTGGAA TTTCTGGTCG GGCGGTGGAA AGTGGCGGAT 1140 
391 

392 TTGACCGGCC TATCTGGTGA AGGGCGTAAA GCGCAAGATT ATGTTTGCGG GTTGCCACCA 1200 
393 

394 AGAATCAGAA GGCTGGAGGA GAGAGCTCAA GGGCGAGCAA AGGAAGGACC TGTTGTTCCA 1260 
395 

396 TTCAGCTGGA TTTTCGATAG ACAGGTGAAG CTGTGAAGAA AAAAAAAACG AGCAGTGAGT 1320 
397 

398 TCGGTTTCTG TTGGCTTATT GGGIAGAGGT TAAAACCTAT TTTAGATGTC TGTTTCGTGT 1380 
399 

400 AATGTGGTTT TTTTTCTTCT AATCTTGAAT CTGGTATTGT GTCGTTGAGT TCGCGTGTGT 1440 
401 

402 GTAAACTTGT GTGGCTGTGG ACATATTATA GAACTCGTTA TGCCAATTTT GATGACGGTG 1500 
403 

404 GTTATCGTCT CCCCTGGTGT TTTTTTATTG TTT 1533 

405 

406 

407 (2) INFORMATION FOR SEQ ID NO: 13: 
408 

409 (i) SEQUENCE CHARACTERISTICS: 

410 

411 (A) LENGTH: 396 amino acids 

412 

413 (B) TYPE: amino acid 

414 

415 (D) TOPOLOGY: linear 

416 

417 (ii) MOLECULE TYPE: protein 

418 

419 

420 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
421 

422 Met Ala Leu Arg lie Thr Pro Val Thr Leu Gin Ser Glu Arg Tyr Arg 

423 -30 -25 -20 
424 
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425 Ser Phe Ser Phe Pro Lys Lys Ala Asn Leu Arg Ser Pro Lys Phe Ala 

426 -15 -10 -5 
427 

428 

429 Met Ala Ser Thr Leu Gly Ser Ser Thr Pro Lys Val Asp Asn Ala Lys 

430 1 5 10 15 
431 

432 Lys Pro Phe Gin Pro Pro Arg Glu Val His Val Gin Val Thr His Ser 

433 20 25 30 
434 

435 Met Pro Pro Gin Lys lie Glu lie Phe Lys Ser lie Glu Gly Trp Ala 

436 35 40 45 
437 

438 Glu Gin Asn lie Leu Val His Leu Lys Pro Val Glu Lys Cys Trp Gin 

439 50 55 60 
440 

441 Ala Gin Asp Phe Leu Pro Asp Pro Ala Ser Glu Gly Phe Asp Glu Gin 

442 65 70 75 
443 

444 Val Lys Glu Leu Arg Ala Arg Ala Lys Glu He Pro Asp Asp Tyr Phe 

445 80 85 90 95 
446 

447 Val Val Leu Val Gly Asp Met He Thr Glu Glu Ala Leu Pro Thr Tyr 

448 100 105 110 
449 

450 Gin Thr Met Leu Asn Thr Leu Asp Gly Val Arg Asp Glu Thr Gly Ala 

451 115 120 125 
452 

453 Ser Leu Thr Pro Trp Ala Val Trp Thr Arg Ala Trp Thr Ala Glu Glu 

454 130 135 140 
455 

456 Asn Arg His Gly Asp Leu Leu His Thr Tyr Leu Tyr Leu Ser Gly Arg 

457 145 150 155 
458 

459 Val Asp Met Arg Gin He Gin Lys Thr He Gin Tyr Leu He Gly Ser 

460 160 165 170 175 
461 

462 Gly Met Asp Pro Arg Thr Glu Asn Ser Pro Tyr Leu Gly Phe He Tyr 

463 180 185 190 
464 

465 Thr Ser Phe Gin Glu Arg Ala Thr Phe Val Ser His Gly Asn Thr Ala 

466 195 200 205 
467 

468 Arg His Ala Lys Asp His Gly Asp Val Lys Leu Ala Gin He Cys Gly 

469 210 215 220 
470 

471 Thr He Ala Ser Asp Glu Lys Arg His Glu Thr Ala Tyr Thr Lys He 

472 225 230 235 
473 

474 Val Glu Lys Leu Phe Glu He Asp Pro Asp Gly Thr Val Leu Ala Phe 

475 240 245 250 255 
476 

477 Ala Asp Met Met Arg Lys Lys He Ser Met Pro Ala His Leu Met Tyr 
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478 260 265 270 

479 

480 Asp Gly Arg Asp Asp Asa Leu Phe Glu His Pbe Ser Ala Val Ala Gin 

481 275 280 285 
482 

483 Arg Leu Gly Val Tyr Thr Ala Lys Asp Tyr Ala Asp lie Leu Glu Phe 

484 290 295 300 
485 

486 Leu Val Gly Arg Trp Lys Val Ala Asp Leu Thr Gly Leu Ser Gly Glu 

487 305 310 315 
488 

489 Gly Arg Lys Ala Gin Asp Tyr Val Cys Gly Leu Pro Pro Arg lie Arg 

490 320 325 330 335 
491 

492 Arg Leu Glu Glu Arg Ala Gin Gly Arg Ala Lys Glu Gly Pro Val Val 

493 340 345 350 
494 

495 Pro Phe Ser Trp lie Phe Asp Arg Gin Val Lys Leu 

496 355 360 
497 

498 

499 (2) INFORMATION FOR SEQ ID NO: 14: 
500 

501 (i) SEQUENCE CHARACTERISTICS: 

502 

503 (A) LENGTH: 225 base pairs 

504 

505 (B) TYPE: nucleic acid 

506 

507 (C) STRANDEDNESS: double 

508 

509 (D) TOPOLOGY: linear 

510 

511 (ii) MOLECULE TYPE: cDNA to mRNA 
512 

513 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
514 

515 AAAAGAAAAA GGTAAGAAAA AAAACA ATG GCT CTC AAG CTC AAT CCT TTC CTT 53 

516 MET Ala Leu Lys Leu Asn Pro Phe Leu 

517 1 5 
518 

519 TCT CAA ACC CAA AAG TTA CCT TCT TTC GCT CTT CCA CCA ATG GCC AGT 101 

520 Ser Gin Thr Gin Lys Leu Pro Ser Phe Ala Leu Pro Pro MET Ala Ser 

521 10 15 20 25 
522 

523 ACC AGA TCT CCT AAG TTC TAC ATG GCC TCT ACC CTC AAG TCT GGT TCT 149 

524 Thr Arg Ser Pro Lys Phe Tyr MET Ala Ser Thr Leu Lys Ser Gly Ser 

525 30 35 40 
526 

527 AAG GAA GTT GAG AAT CTC AAG AAG CCT TTC ATG CCT CCT CGG GAG GTA 197 

528 Lys Glu Val Glu Asn Leu Lys Lys Pro Phe MET Pro Pro Arg Glu Val 

529 45 50 55 
530 
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531 
532 
533 
534 
535 
536 
537 
538 
539 
540 
541 
542 
543 
544 
545 
546 
547 
548 
549 
550 


CAT GTT CAG GTT ACC CAT TCT ATT GCC A 
His Val Gin Val Thr His Ser lie Ala 
60 65 


(ii) MOLECULE TYPE: 


(2) INFORMATION FOR SEQ ID NO: 15: 


(B) TYPE: 


(A) LENGTH: 


(i) SEQUENCE CHARACTERISTICS: 


(D) TOPOLOGY: 


(C) STRANDEDNESS : 


nucleic acid 


1668 base pairs 


linear 


CDNA to mRNA 


double 


551 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
552 

553 AAAAGAAAAA GGTAAGAAAA AAAACAATGG CTCTCAAGCT CAATCCTTTC CTTTCTCAAA 
554 

555 CCCAAAAGTT ACCTTCTTTC GCTCTTCCAC CAATGGCCAG TACCAGATCT CCTAAGTTCT 
556 

557 ACATGGCCTC TACCCTCAAG TCTGGTTCTA AGGAAGTTGA GAATCTCAAG AAGCCTTTCA 
558 

559 TGCCTCCTCG GGAGGTACAT GTTCAGGTTA CCCATTCTAT GCCACCCCAA AAGATTGAGA 
560 

561 TCTTTAAATC CCTAGACAAT TGGGCTGAGG AGAACATTCT GGTTCATCTG AAGCCAGTTG 
562 

563 AGAAATGTTG GCAACCGCAG GATTTTTTGC CAGATCCCGC CTCTGATGGA TTTGATGAGC 
564 

565 AAGTCAGGGA ACTCAGGGAG AGAGCAAAGG AGATTCCTGA TGATTATTTT GTTGTTTTGG 
566 

567 TTGGAGACAT GATAACGGAA GAAGCCCTTC CCACTTATCA AACAATGCTG AATACCTTGG 
568 

569 ATGGAGTTCG GGATGAAACA GGTGCAAGTC CTACTTCTTG GGCAATTTGG ACAAGGGCAT 
570 

571 GGACTGCGGA AGAGAATAGA CATGGTGACC TCCTCAATAA GTATCTCTAC CTATCTGGAC 
572 

573 GAGTGGACAT GAGGCAAATT GAGAAGACAA TTCAATATTT GATTGGTTCA GGAATGGATC 
574 

575 CACGGACAGA AAACAGTCCA TACCTTGGGT TCATCTATAC ATCATTCCAG GAAAGGGCAA 
576 

577 CCTTCATTTC TCATGGGAAC ACTGCCCGAC AAGCCAAAGA GCATGGAGAC ATAAAGTTGG 
578 

579 CTCAAATATG TGGTACAATT GCTGCAGATG AGAAGCGCCA TGAGACAGCC TACACAAAGA 
580 

581 TAGTGGAAAA ACTCTTTGAG ATTGATCCTG ATGGAACTGT TTTGGCTTTT GCTGATATGA 
582 

583 TGAGAAAGAA AATTTCTATG CCTGCACACT TGATGTATGA TGGCCGAGAT GATAATCTTT 


60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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584 

585 TTGACCACTT TTCAGCTGTT GCGCAGCGTC TTGGAGTCTA CACAGCAAAG GATTATGCAG 1020 
586 

587 ATATATTGGA GTTCTTGGTG GGCAGATGGA AGGTGGATAA ACTAACGGGC CTTTCAGCTG 1080 
588 

589 AGGGACAAAA GGCTCAGGAC TATGTTTGTC GGTTACCTCC AAGAATTAGA AGGCTGGAAG 1140 
590 

591 AGAGAGCTCA AGGAAGGGCA AAGGAAGCAC CCACCATGCC TTTCAGCTGG ATTTTCGATA 1200 
592 

593 GGCAAGTGAA GCTGTAGGTG GCTAAAGTGC AGGACGAAAC CGAAATGGTT AGTTTCACTC 1260 
594 

595 TTTTTCATGC CCATCCCTGC AGAATCAGAA GTAGAGGTAG AATTTTGTAG TTGCTTTTTT 1320 
596 

597 ATTACAAGTC CAGTTTAGTT TAAGGXCTGT GGAAGGGAGT TAGTTGAGGA GTGAATTTAG 1380 
598 

599 TAAGTTGTAG ATACAGTTGT TICTTGTGTT GTCATGAGTA TGCTGATAGA GAGCAGCTGT 1440 
600 

601 AGTTTTGTTG TTGTGTTCTT TTATATGGTC TCTTGTATGA GTTTCTTTTC TTTCCTTTTC 1500 
602 

603 TTCTTTCCTT TCCTCTCTCT CTCTCTCTCT CTCTCTCTTT TTCTCTTATC CCAAGTGTCT 1560 
604 

605 CAAGTATAAT AAGCAAACGA TCCATGTGGC AATTTTGATG ATGGTGATCA GTCICACAAC 1620 
606 

607 TTGATCTTTT GTCTTCTATT GGAAACACAG CCTGCTTGTT TGAAAAAA 1668 
608 
609 

610 (2) INFORMATION FOR SEQ ID NO: 16: 
611 

612 (i) SEQUENCE CHARACTERISTICS: 

613 

614 (A) LENGTH: 396 amino acids 

615 

616 (B) TYPE: amino acid 

617 

618 (D) TOPOLOGY: linear 

619 

620 (ii) MOLECULE TYPE: protein 
621 
622 

623 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
624 

625 MET Ala Leu Lys Leu Asn Pro Phe Leu Ser Gin Thr 

626 1 5 10 
627 

628 Ser Phe Ala Leu Pro Pro MET Ala Ser Thr Arg Ser 

629 20 25 
630 

631 MET Ala Ser Thr Leu Lys Ser Gly Ser Lys Glu Val 

632 35 40 
633 

634 Lys Pro Phe MET Pro Pro Arg Glu Val His Val Gin 

635 50 55 60 
636 


Gin Lys Leu Pro 
15 

Pro Lys Phe Tyr 
30 

Glu Asn Leu Lys 
45 

Val Thr His Ser 
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637 MET Pro Pro Gin Lys lie Glu lie Phe Lys Ser Leu Asp Asn Trp Ala 

638 65 70 75 80 
639 

640 Olu Glu Asn lie Leu Val His Leu Lys Pro Val Glu Lys Cys Trp Gin 

641 85 90 95 
642 

643 

644 Pro Gin Asp Phe Leu Pro Asp Pro Ala Ser Asp Gly Phe Asp Glu Gin 

645 100 105 110 
646 

647 Val Arg Glu Leu Arg Glu Arg Ala Lys Glu lie Pro Asp Asp Tyr Phe 

648 115 120 125 
649 

650 Val Val Leu Val Gly Asp NET lie Thr Glu Glu Ala Leu Pro Thr Tyr 

651 130 135 140 
652 

653 Gin Thr MET Leu Asn Thr Leu Asp Gly Val Arg Asp Glu Thr Gly Ala 

654 145 150 155 160 
655 

656 Ser Pro Thr Ser Trp Ala lie Trp Thr Arg Ala Trp Thr Ala Glu Glu 

657 165 170 175 
658 

659 Asn Arg His Gly Asp Leu Leu Asn Lys Tyr Leu Tyr Leu Ser Gly Arg 

660 180 185 190 
661 

662 Val Asp MET Arg Gin He Glu Lys Thr He Gin Tyr Leu He Gly Ser 

663 195 200 205 
664 

665 Gly MET Asp Pro Arg Thr Glu Asn Ser Pro Tyr Leu Gly Phe He Tyr 

666 210 215 220 
667 

668 Thr Ser Phe Gin Glu Arg Ala Thr Phe lie Ser His Gly Asn Thr Ala 

669 225 230 235 240 
670 

671 Arg Gin Ala Lys Glu His Gly Asp lie Lys Leu Ala Gin lie Cys Gly 

672 245 250 255 
673 

674 Thr lie Ala Ala Asp Glu Lys Arg His Glu Thr Ala Tyr Thr Lys He 

675 260 265 270 
676 

677 Val Glu Lys Leu Phe Glu He Asp Pro Asp Gly Thr Val Leu Ala Phe 

678 275 280 285 
679 

680 Ala Asp MET MET Arg Lys Lys lie Ser MET Pro Ala His Leu MET Tyr 

681 290 295 300 
682 

683 Asp Gly Arg Asp Asp Asn Leu Phe Asp His Phe Ser Ala Val Ala Gin 

684 305 310 315 320 
685 

686 Arg Leu Gly Val Tyr Thr Ala Lys Asp Tyr Ala Asp lie Leu Glu Phe 

687 325 330 335 
688 

689 Leu Val Gly Arg Trp Lys Val Asp Lys Leu Thr Gly Leu Ser Ala Glu 
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690 340 345 350 

691 

692 Gly Gin Lys Ala Gin Asp Tyr Val Cys Arg Leu Pro Pro Arg lie Arg 

693 355 360 365 
694 

695 Arg Leu Glu Glu Arg Ala Gin Gly Arg Ala Lys Glu Ala Pro Thr MET 

696 370 375 380 
697 

698 Pro Phe Ser Trp lie Phe Asp Arg Gin Val Lys Leu 

699 385 390 395 
700 

701 

702 (2) INFORMATION FOR SEQ ID NO: 17: 

703 

704 (i) SEQUENCE CHARACTERISTICS: 

705 

706 (A) LENGTH: 117 base pairs 

707 

708 (B) TYPE: nucleic acid 

709 

710 (C) STRANDEDNESS: double 

711 

712 (D) TOPOLOGY: linear 

713 

714 (ii) MOLECULE TYPE: cDNA to mRNA 

715 

716 

717 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
718 

719 TGAGAGATAG TGTGAGAGCA TTAGCCTTAG AGAGAGAGAG AGAGAGCTTG TGTCTGAAAG 60 
720 

721 AATCCACAA ATG GCA TTG AAG CTT AAC CCT TTG GCA TCT CAG CCT TAC AAC 111 

722 MET Ala Leu Lys Leu Asn Pro Leu Ala Ser Gin Pro Tyr Asn 

723 1 5 10 
724 

725 TTC CCT 117 

726 Phe Pro 

727 15 
728 
729 

730 (2) INFORMATION FOR SEQ ID NO: 18: 
731 

732 (i) SEQUENCE CHARACTERISTICS: 

733 

734 (A) LENGTH: 176 base pairs 

735 

736 (B) TYPE: nucleic acid 

737 

738 (C) STRANDEDNESS: double 

739 

740 (D) TOPOLOGY: linear 

741 

742 (ii) MOLECULE TYPE: cDNA to mRNA 
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743 
744 

745 (xi) SEQUENCE DESCRIPTION: 
746 

747 ACT TCA TGG GCT ATT T6G ACA 

748 Thr Ser Trp Ala lie Trp Thr 

749 1 5 
750 
751 

752 CAC GGT GAT CTT CTC AAT AAG 

753 His Gly Asp Leu Leu Asn Lys 

754 20 
755 

756 ATG AGG CAG ATT GAA AAG ACC 

757 MET Arg Gin He Glu Lys Thr 

758 35 
759 

760 GAT CCT AGA ACA GAG AAC AAT 

761 Asp Pro Arg Thr Glu Asn Asn 

762 50 55 
763 
764 

765 (2) INFORMATION FOR SEQ ID NO: 19: 
766 

767 (i) SEQUENCE CHARACTERISTICS: 

768 

769 (A) LENGTH: 1495 base pairs 

770 

771 (B) TYPE: nucleic acid 

772 

773 (C) STRANDEDNESS: double 

774 

775 (D) TOPOLOGY: linear 

776 

777 (ii) MOLECULE TYPE: cDNA to mRNA 

778 

779 

780 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
781 

782 TGAGAGATAG TGTGAGAGCA TTAGCCTTAG AGAGAGAGAG AGAGAGCTTG TGTCTGAAAG 60 
783 

784 AATCCACAAA TGGCATTGAA GCTTAACCCT TTGGCATCTC AGCCTTACAA CTTCCCTTCC 120 
785 

786 TCGGCTCGTC CGCCAATCTC TACTTTCAGA TCTCCCAAGT TCCTCTGCCT CGCTTCTTCT 180 
787 

788 TCTCCCGCTC TCAGCTCCAA GGAGGTTGAG AGTTTGAAGA AGCCATTCAC ACCACCTAAG 240 
789 

790 GAAGTGCACG TTCAAGTCCT GCATTCCATG CCACCCCAGA AGATCGAGAT CTTCAAATCC 300 
791 

792 ATGGAAGACT GGGCCGAGCA GAACCTTCTA ACTCAGCTCA AAGACGTGGA GAAGTCGTGG 360 
793 

794 CAGCCCCAGG ACTTCTTACC CGACCCTGCA TCCGATGGGT TCGAAGATCA GGTTAGAGAG 420 
795 


SEQ ID NO: 18: 

AGA GCT TGG ACT GCA GAA GAG AAC CGA 48 
Arg Ala Trp Thr Ala Glu Glu Asn Arg 
10 15 


TAT CTT TAC TTG TCT GGA CGT GTT GAC 96 
Tyr Leu Tyr Leu Ser Gly Arg Val Asp 
25 30 

ATT CAG TAC TTG ATT GGT TCT GGA ATG 144 
He Gin Tyr Leu He Gly Ser Gly MET 
40 45 

CCT TAC CTC GG 176 
Pro Tyr Leu 
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796 CTAAGAGAGA GGGCAAGAGA GCTCCCTGAT GATTACTTCG TTGTTCTGGT GGGAGACATG 480 
797 

798 ATCACGGAAG AGGCGCTTCC GACCTATCAA ACCATGTTGA ACACTTTGGA TGGAGTGAGG 540 
799 

800 GATGAAACTG GCGCTAGCCC CACTTCATGG GCTATTTGGA CAAGAGCTTG GACTGCAGAA 600 
801 

802 GAGAACCGAC ACGGTGATCT TCTCAATAAG TATCTTTACT TGTCTGGACG TGTTGACATG 660 
803 

804 AGGCAGATTG AAAAGACCAT TCAGTACTTG ATTGGTTCTG GAATGGATCC TAGAACAGAG 720 
805 

806 AACAATCCTT ACCTCGGCTT CATCTACACT TCATTCCAAG AAAGAGCCAC CTTCATCTCT 780 
807 

808 CACGGAAACA CAGCTCGCCA AGCCAAAGAG CACGGAGACC ICAAGCTAGC CCAAATCTGC 840 
809 

810 GGCACAATAG CTGCAGACGA GAAGCGTCAT GAGACAGCTT ACACCAAGAT AGTTGAGAAG 900 
811 

812 CTCTTTGAGA TTGATCCTGA TGGTACTGTG ATGGCGTTTG CAGACATGAT GAGGAAGAAA 960 
813 

814 ATCTCGATGC CTGCTCACTT GATGTACGAT GGGCGGGATG AAAGCCTCTT TGACAACTTC 1020 
815 

816 TCTTCTGTTG CTCAGAGGCT CGGTGTTTAC ACTGCCAAAG ACTATGCGGA CATTCTTGAG 1080 
817 

818 TTTTTGGTTG GGAGGTGGAA GATTGAGAGC TTGACCGGGC TTTCAGGTGA AGGAAACAAA 1140 
819 

820 GCGCAAGAGT ACTTGTGTGG GTTGACTCCA AGAATCAGGA GGTTGGATGA GAGAGCTCAA 1200 
821 

822 GCAAGAGCCA AGAAAGGACC CAAGGTTCCT TTCAGCTGGA TACATGACAG AGAAGTGCAG 1260 
823 

824 CTCTAAAAAG GAACAAAGCT ATGAAACCTT TTCACTCTCC GTCGTCCCTC ATTTGATCTA 1320 
825 

826 TCTGCTCTTG AAATTGGTGT AGATTACTAT GGTTTGTGAT ATTGTTCGTG GGTCTAGTTA 1380 
827 

828 CAAAGTTGAG AAGCAGTGAT TTAGTAGCTT TGTTGTTTCC AGTCTTTAAA TGTTTTTGTG 1440 
829 

830 TTTGGTCCTT TTAGTAAACT TGTTGTAGTT AAATCAGTTG AACTGTTTGG TCTGT 1495 

831 

832 

833 (2) INFORMATION FOR SEQ ID NO: 20: 
834 

835 (i) SEQUENCE CHARACTERISTICS: 

836 

837 (A) LENGTH: 398 amino acids 

838 

839 (B) TYPE: amino acid 

840 

841 (D) TOPOLOGY: linear 

842 

843 (ii) MOLECULE TYPE: protein 

844 

845 

846 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
847 

848 MET Ala Leu Lys Leu Asn Pro Leu Ala Ser Gin Pro Tyr Asn Phe Pro 
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849 1 5 10 15 

850 

851 Ser Ser Ala Arg Pro Pro lie Ser Thr Phe Arg Ser Pro Lys Phe Leu 

852 20 25 30 
853 

854 Cys Leu Ala Ser Ser Ser Pro Ala Leu Ser Ser Lys Glu Val Olu Ser 

855 35 40 45 
856 

857 Leu Lys Lys Pro Phe Thr Pro Pro Lys Glu Val His Val Gin Val Leu 

858 50 55 60 
859 

860 His Ser MET Pro Pro Gin Lys He Glu He Phe Lys Ser MET Glu Asp 

861 65 70 75 80 

. 862 

863 Trp Ala Glu Gin Asn Leu Leu Thr Gin Leu Lys Asp Val Glu Lys Ser 

864 85 90 95 
865 

866 Trp Gin Pro Gin Asp Phe Leu Pro Asp Pro Ala Ser Asp Gly Phe Glu 

867 100 105 110 
868 

869 Asp Gin Val Arg Glu Leu Arg Glu Arg Ala Arg Glu Leu Pro Asp Asp 

870 115 120 125 
871 

872 Tyr Phe Val Val Leu Val Gly Asp MET He Thr Glu Glu Ala Leu Pro 

873 130 135 140 
874 

875 Thr Tyr Gin Thr MET Leu Asn Thr Leu Asp Gly Val Arg Asp Glu Thr 

876 145 150 155 160 
877 

878 Gly Ala Ser Pro Thr Ser Trp Ala He Trp Thr Arg Ala Trp Thr Ala 

879 165 170 175 
880 

881 Glu Glu Asn Arg His Gly Asp Leu Leu Asn Lys Tyr Leu Tyr Leu Ser 

882 180 185 190 
883 

884 Gly Arg Val Asp MET Arg Gin He Glu Lys Thr He Gin Tyr Leu He 

885 195 200 205 
886 

887 Gly Ser Gly MET Asp Pro Arg Thr Glu Asn Asn Pro Tyr Leu Gly Phe 

888 210 215 220 
889 

890 He Tyr Thr Ser Phe Gin Glu Arg Ala Thr Phe He Ser His Gly Asn 

891 225 230 235 240 
892 

893 Thr Ala Arg Gin Ala Lys Glu His Gly Asp Leu Lys Leu Ala Gin He 

894 245 250 255 
895 

896 Cys Gly Thr He Ala Ala Asp Glu Lys Arg His Glu Thr Ala Tyr Thr 

897 260 265 270 
898 

899 Lys He Val Glu Lys Leu Phe Glu He Asp Pro Asp Gly Thr Val Met 

900 275 280 285 
901 
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902 Ala Phe Ala Asp MET MET Arg Lys Lys lie Ser Met Pro Ala His Leu 

903 290 295 300 
904 

905 Met Tyr Asp Gly Arg Asp Olu Ser Leu Phe Asp Asa Phe Ser Ser Val 

906 305 310 315 320 
907 

908 Ala Gin Arg Leu Gly Val Tyr Thr Ala Lys Asp Tyr Ala Asp lie Leu 

909 325 330 335 
910 

911 Glu Phe Leu Val Gly Arg Trp Lys lie Glu Ser Leu Thr Gly Leu Ser 

912 340 345 350 
913 

914 Gly Glu Gly Asn Lys Ala Gin Glu Tyr Leu Cys Gly Leu Thr Pro Arg 

915 355 360 365 
916 

917 lie Arg Arg Leu Asp Glu Arg Ala Gin Ala Arg Ala Lys Lys Gly Pro 

918 370 375 380 
919 

920 Lys Val Pro Phe Ser Trp lie His Asp Arg Glu Val Gin Leu 

921 385 390 395 
922 

923 

924 (2) INFORMATION FOR SEQ ID N0:21: 
925 

926 (i) SEQUENCE CHARACTERISTICS: 

927 

928 (A) LENGTH: 29 base pairs 

929 

930 (B) TYPE: nucleic acid 

931 

932 (C) STRANDEDNESS: single 

933 

934 (D) TOPOLOGY: linear 

935 

936 (ii) MOLECULE TYPE: other nucleic acid 
937 

938 (A) DESCRIPTION: synthetic oligonucleotide mixture 

939 

940 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
941 

942 GCTAAGCTTA ARGARATHCC AGAYGAYTA 29 

943 

944 

945 (2) INFORMATION FOR SEQ ID NO: 22: 
946 

947 (i) SEQUENCE CHARACTERISTICS: 

948 

949 (A) LENGTH: 29 base pairs 

950 

951 (B) TYPE: nucleic acid 

952 

953 (C) STRANDEDNESS: single 

954 
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955 
956 
957 
958 
959 
960 
961 
962 
963 
964 
965 
966 
967 
968 
969 
970 
971 
972 
973 
974 
975 
976 
977 
978 
979 
980 
981 
982 
983 
984 
985 
986 
987 
988 
989 
990 
991 
992 
993 
994 
995 
996 
997 
998 
99? 
1000 
1001 
1002 
1003 
1004 
1005 
1006 
1007 


(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic oligonucleotide mixture 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GCTAAGCTTA ARGARATHCC GGAYGAYTA 29 

(2) INFORMATION FOR SEQ ID NO: 23: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic oligonucleotide mixture 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
GCTAAGCTTA ARGARATHCC CGAYGAYTA 29 

(2) INFORMATION FOR SEQ ID NO: 24: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic oligonucleotide mixture 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GCTAAGCTTA ARGARATHCC TGAYGAYTA 29 
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1008 

1009 (2) INFORMATION FOR SEQ ID NO: 25: 
1010 

1011 (i) SEQUENCE CHARACTERISTICS: 

1012 

1013 (A) LENGTH: 26 base pairs 

1014 

1015 (B) TYPE: nucleic acid 

1016 

1017 (C) STRANDEDNESS: single 

1018 

1019 (D) TOPOLOGY: linear 

1020 

1021 (ii) MOLECULE TYPE: other nucleic acid 
1022 

1023 (A) DESCRIPTION: synthetic oligonucleotide mixture 

1024 

1025 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
1026 

1027 AGCGAATTCG TRTTNAGCAT NGTYTG 26 

1028 

1029 

1030 (2) INFORMATION FOR SEQ ID NO: 26: 
1031 

1032 (i) SEQUENCE CHARACTERISTICS: 

1033 

1034 (A) LENGTH: 26 base pairs 

1035 

1036 (B) TYPE: nucleic acid 

1037 

1038 (C) STRANDEDNESS: single 

1039 

1040 (D) TOPOLOGY: linear 

1041 

1042 (ii) MOLECULE TYPE: other nucleic acid 
1043 

1044 (A) DESCRIPTION: synthetic oligonucleotide mixture 

1045 

1046 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
1047 

1048 AGCGAATTCG TRTTYAACAT NGTYTG 26 

1049 

1050 

1051 (2) INFORMATION FOR SEQ ID NO: 27: 
1052 

1053 (i) SEQUENCE CHARACTERISTICS: 

1054 

1055 (A) LENGTH: 3440 base pairs 

1056 

1057 (B) TYPE: nucleic acid 

1058 

1059 (C) STRANDEDNESS: double 

1060 
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1061 
1062 
1063 
1064 
1065 
1066 
1067 
1068 
1069 
1070 
1071 
1072 
1073 
1074 
1075 
1076 
1077 
1078 
1079 
1080 
1081 
1082 
1083 
1084 
1085 
1086 
1087 
1088 
1089 
1090 
1091 
1092 
1093 
1094 
1095 
1096 
1097 
1098 
1099 
1100 
1101 
1102 
1103 
1104 
1105 
1106 
1107 
1108 
1109 
1110 
1111 
1112 
1113 


(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
TCTAGAATTC TCTAATTACG TCTGTTTGTT CTATTTTTTA TATGATATCA 
ATAAATATAT GGTTTAAGAT GCCAAAAAAT TATTTACTTG GTGAATATAA 
ATTAGAAATA CATCATTTAG TTAAATAAAT AACCAAAAAC CAAAAATTCA 
GGCGCGCGGT CAGGGTCTCG TTAGTTTTAA AATCAATGCA GTTTACAATT 
TGAAAATAAG TATAATTTGT ATTGAAATTA TAAAGTGACA TTTTTTGTGT 
TTGTGTAACA AGAATTAAAA AAAAAAACAG AAAATACTCA GCTTTTTTAA 
AATTAATTGA GTTAGAAAAT TGTTGTACCA ATAACAAAAG ATTTATATGG 
TCAACACACC AATAACACAA GACTTTTTAA AAATTTAAGA ATAATATAAG 
AGAATCTTCA AATTCTTCAA ATCCTTAAAA ATCAATCTCC CACTATTAAT 
TTTAGTTGGT AATGGCAACG TTTGTTGACT ACCGTATTGT AACTTTTGTC 
AAATACGTGT CAAACTCTGG TAAAAAATTA GTCTGCTACA TCTGTCTTTT 
CACAGCTGTT AATCAGAATT TGGTTTATTA AATCAACAAC CTGCACGAAA 
CATATTTTGT CTGTTTCTGG TTCATGACCT TCTTCCGCAT GATGGCCAAG 
ACTTGCAAGA GCGTTTCTTC AACGAGATAA GTCGAACAAA TATTTGTCCG 
ATATAAAATC TCCCCATCTC TATATATAAT ACCAGCATTC ACCATCATGA 
TCCCAATCTC ACAAATACTT CAATAAAAAG ACCAAAAAAA ATTAAAGCAA 
TCTTGTGCAC AAAAAAAAAA GAAGCCTTCT AGGTTTTCAC GACATGAAGT 
AATGGTCATC ACATTGGTGA TAATCGCCAT CTCGTCTCCT GTTCCAATTA 
GGTTGAAAGT TTCGGAGAAG TGGCACAATC GTGTGTTGTG ACAGAACTCG 
ACCAGCAATG ACCACGGCAG GAGACCCGAC TACAGAATGC TGCGACAAAC 
GAAACCATGT CTTTGTGGTT ATATTCGAAA CCCAGCCTAT AGTATGTATG 
AAACGGTCGC AAAGTCTTAG ATTTTTGTAA GGTTCCTTTT CCTAGTTGTT 
AGACATTGCT AAGAAAAATA TTATTAAAAA TAAAAGAATC AAACTAGATC 


AATATTCGTC 60 

TACGTTAAAT 120 

TATCCGCGCT 180 

AATTTCCAGC 240 

AACAAATATT 300 

TAATAAAAAA 360 

AATTATAAAA 420 

CAATAACAAT 480 

CCCCCTTAGT 540 

AAATTGTCAT 600 

ATTTATAAAA 660 

CTTGTGTGAG 720 

TGTAATGGCC 780 

TTACGACCAC 840 

ATACCTCAAA 900 

AGAAAAGCCT 960 

TCACTACTCT 1020 

GAGCAACCAC 1080 

CCCCATGCTT 1140 

TGGTAGAGCA 1200 

TTACTTCTCC 1260 

AAATCTCTCA 1320 

TGATGTAACA 1380 
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1114 ATGAATCATC ATGTTATGGT TGAAGCTTAT ATGCTGAAGT GTTTGATTTT ATATATGTGT 1440 
1115 

1116 GTGTGTGTGT CCTGCTCAAT TTTTGAAACA CACACGTTTC TCCTGATTTG GATTTAAATT 1500 
1117 

1118 ATATTTTGAG TTAAAAAAAA GAAAAAGATG GAATGCTATT TATACAAGTT GATGAAAAAG 1560 
1119 

1120 TGGAAGTACA ATTTAGATAT CTCCTACACT TAAAGAATGA AACAATAATA GACTTACGAA 1620 
1121 

1122 ACAAATGAAA AATACATAAA TTGTCGACAA TCAACGTCCG ATGACGAGTT TATTATTAAA 1680 
1123 

1124 AATTTGTGTG AAGGACTAGC AGTTCAACCA AATGATATTG AACATATACA TCAACAAATA 1740 
1125 

1126 TGATAATCAT AAAAGAGAGA ATGGGGGGGG GGTGTCGTTT ACCAGAAACC TCTTTTTCTC 1800 
1127 

1128 AGCTCGCTAA AACCCTACCA CTAGAGACCT AGCTCTGACC GTCGGCTCAT CGGTGCCGGA 1860 
1129 

1130 GGTGTAACCT TTCTTTCCCA TGACCCGAAA CCTCTCTTTC CCAACTCACG AAAACCCTAC 1920 
1131 

1132 AATCAAAAAC CTAGCTCCGA CCGTCGGCTC ATCGGTGCCG AAGGTGTAAC CTTTCTCTCC 1980 
1133 

1134 CATCATAGTT TCTCGTAAAT GAAAGCTAAT TGGGCAATCG ATTTTTTAAT GTTTAAACCA 2040 
1135 

1136 TGCCAAGCCA TTTCTTATAG GACAATTGTC AATAATAGCA TCTTTTGAGT TTTGTCTCAA 2100 
1137 

1138 AAGTGACACT AGAAGAAAAA AGTCACAAAA ATGACATTCA TTAAAAAGTA AAATATCCCT 2160 
1139 

1140 AATACCTTTG GTTTAAATTA AATAAGTAAA CAAAAATAAA TAAAAACAAA TAAAATAAAA 2220 
1141 

1142 ATAAAAAATG AAAAAAAGAA ATTTTTTTAT AGTTTCAGAT TATATGTTTT CAGATTCGAA 2280 
1143 

1144 ATTTTTTAAA TTCCCTTTTT TAAATTTTCT TTTTTGAAAT TTTTTTTTTT GAAATTTTTT 2340 
1145 

1146 GAAACTGTTT TTAAAATTTT TATTTTTAAT TTTTTAGTAT TTATTTTTTA TTTTATAAAA 2400 
1147 

1148 TTTTAAACGC TAATTCCAAA ACTCCCCCCC CCCCCCCCCC CCCAATTCTC TCCTAGTCTT 2460 
1149 

1150 TTTCTCTTTC TTATATTTGG GCTTCTATCT TCTCTTTTTT TTTCAGGCCC AAAGTATCAT 2520 
1151 

1152 GTGTAACAAC CGGTGTTCAA AAACGCGCCC GCCTGGCCGT TTACTCGCCC GATTAAATGA 2580 
1153 

1154 TGATCGGAAG GCTGCCATGG CGAGGCGGAG GTAATCAGTG GTTCTAGGCG CTGAAACTAG 2640 
1155 

1156 AAAACCTTCA AAAATCGAAA TTTTAAGAGC TAAATCGGTG TTTATCTCAT GAATCTATTA 2700 
1157 

1158 TATTTAGTTG AAACTCACAA GAATCGGTTG TAAAAACTAT GAAATCGIGC AAAAAAAATG 2760 
1159 

1160 AAGAACAAAA TATTCTCAGA TCTGGAAAAC ACAGAGAAGA GGTTGAAGAT GAGGGTAAAA 2820 
1161 

1162 TCGTATTTTG TCATTCATTA AACTAAAATC AAAAAAAAAT GATGCAAAAT TCAATGATAA 2880 
1163 

1164 TAACTCGAAC TCGCAACCAT ATGCATCTTT AGACTGCGAC ACGGACCACT AGACTAAGCA 2940 
1165 

1166 ATTTTAATGT TTATTCATCA CAGACCTAAT ATATGTCTAA AACTAGGCGC CGAGIACGCC 3000 
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1167 

1168 CCGCTTAATC CCGAGTTTTT GTTAGCTCGC TAGACCCAGG GTCACCGCCC GACTAACGAG 3060 
1169 

1170 TAGCGTAATT CTGAACTGGG GTAACAACAT AGAGAACATC GCCGACCCTT CCCTGCCGAT 3120 
1171 

1172 GATGCCGCCT CCGATGAACT TCCTGTAACG CCTTCAGTTT CCATTGATTT TCCCCTTTAA 3180 
1173 

1174 TCTGATCAGT TCCATGTTTT ATCCAACTCA TCCGACTCCG TAGCATTTAA TCGATCTCAT 3240 
1175 

1176 CATTTACATA CATAACCAGT AGGAGGTCTC ATATAAATTT GAACGTTTCC AGCGATGAAC 3300 
1177 

1178 AGTGCCAATC TCTGCGAAAT CCATTTCTCT AAGCTCAGGG CTGGCGGCTG CAGCCCGGGG 3360 
1179 

1180 ATCCACTAGT TCTAGGCGGC CGCACCGCGG TGGAGCTCCA ATTCGCCCTA TAGTGAG1CG 3420 
1181 

1182 TATTACGCGC GCTCACTGGC 3440 

1183 

1184 

1185 (2) INFORMATION FOR SEQ ID NO: 28: 
1186 

1187 (i) SEQUENCE CHARACTERISTICS: 

1188 

1189 (A) LENGTH: 3898 base pairs 

1190 

1191 (B) TYPE: nucleic acid 

1192 

1193 (C) STRANDEDNESS: double 

1194 

1195 (D) TOPOLOGY: linear 

1196 

1197 (ii) MOLECULE TYPE: genomic DNA 

1198 

1199 

1200 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
1201 

1202 CTCGAGAGCT GAAGGATTTT TTGTTAGAGA TTCAACGACA GATGGACCCT TCCTCCACTA 60 
1203 

1204 GGCAACTGCA AGAACCTAAC AATGCAAATA TCACTCCTCC TCAGCCTTCA AGGAGCGTTA 120 
1205 

1206 ATAGGACTGG AACAAGCGGT CAAGTGAGTA AATTTTCCTT CCAAGATAGA TCTCTATGGT 180 
1207 

1208 TCGGTTCATG AAGTTTGTGG TTTAATTGTG TAGCAACAGG ATAGTGCAAG TGAGAATAGA 240 
1209 

1210 GTTCGACCTC ATCTACCTAC CCCGGAACCT CTGAATGTAT CCCCATTGAA GAAGAAGAGG 300 
1211 

1212 GCAAATCCTG CACCCAGAAG GATAAAGAAA TTTTGGACGC CTGAAGAAGT GGCAGTTCTG 360 
1213 

1214 AGGGAAGGAG TAAAAGAGTA TGTCTACTAC TACTACTCTA TAATCAAGTT TCAAGAAGCT 420 
1215 

1216 GAGCTTGGCT CTCACTTTAT ATGTTTGATG TTGTTGTGCA GGTATGGTAA ATCATGGAAA 480 
1217 

1218 GAGATAAAGA ATGCAAACCC TGAAGTATTG GCAGAGAGGA CTGAGGTGAG AGAGCATGTC 540 
1219 


Page: 24 Raw Sequence Listing 05/21/92 

15 1 17 J 33 

Patent Application US/07/762,762 

1220 ACTTTTGTGT TACTCATCTG AATTATCTTA TATGCGAATT GTAAGTGGTA CTAAAAGGTT 600 
1221 

1222 TGTAACTTTT GGTAGGTGGA TTTGAAGGAT AAATGGAGGA ACTTGCTTCG GTAGCGGTAA 660 
1223 

1224 CAAGTTTTAT ATTGCTATGA AGTTTTTTTG CCTGCGTGAC GTATCAGCAG CTGTGGAGAA 720 
1225 

1226 GATGGTATTA GAAAGGGTCT TTTCACATTT TGTGTTGTGA CAAATATTAA TTCGGCCGGT 780 
1227 

1228 ATGGTTTGGT TAAGACTTGT TGAGAGACGT GTGGGGTTTT TTGATGTATA ATTAGTCTGT 840 
1229 

1230 GTTTAGAACG AAACAAGACT TGTTGCGTAT GCTTTTTTTA ACTTGAGGGG GTTTGTTGTT 900 
1231 

1232 GTTAGTTAGG AACTTGACTT TGTCTCTTTC TCICAAGATC TGATTGGTAA GGTCTGGGTG 960 
1233 

1234 GTAGTACTGT TTGGTTTAAT TTGTTTTGAC TATTGAGTCA CTGTGGCCCA TTGACTTTAA 1020 
1235 

1236 ATTAGGCTGG TATATTTTTT GGTTTAAAAC CGGTCTGAGA TAGTGCAATT TCGATTGAGT 1080 
1237 

1238 CAATTTTAAA TTCTTCAAGG TAATGGGCTG AATACTTGTA TAGTTTTAAG ACTTAACAGG 1140 
1239 

1240 CCTTAAAAGG CCCATGTTAT CATAAAACGT CATTGTTTAG AGTGCACCAA GCTTATAAAA 1200 
1241 

1242 TGTAGCCAGG CCTTAAAAGA CTTAACAGGC CTIAAAAGAC TTAACATTCC TTAAAAGGCC 1260 
1243 

1244 CATGTTATGA TAAAACGTCA TCGTTTTGAG TGCACCAAGC TAAATGTAGC CAGGCCTTAA 1320 
1245 

1246 AAGACTTAAC AGGCCTTAAA AGGCCCATGT TAICATAAAA CGCCGTCGTT TTGAGTGCAC 1380 
1247 

1248 CAAGCTTATA AATGTAGCCA GCTACCTCGG GACATCACGC TCTTTGTACA CTCCGCCATC 1440 
1249 

1250 TCTCTCTCTC TCGAGCAGAT CTCTCTCGGG AATATCGACA ATGTCGACCA CTTTCTGCTC 1500 
1251 

1252 TTCCGTCTCC ATGCAAGCCA CTTCTCTGGT AATCTCATCT CCTTCTTGTG TTCCCAGATC 1560 
1253 

1254 GCTCTGATCA TACTTTCTTT TAGATCATTT GCCTCTGATC TGTTGCTTGA TGTTTGTTAA 1620 
1255 

1256 CTCTCCACGC ATGTTTGATT ATGTTGAGAA TTAGAAAAAA AATGTTAGCT TTACGAATCT 1680 
1257 

1258 TTAGTGATCA TTTCAATTGG ATTTGCAATC TTGTGTGACA TTTGAGGCTT GTGTAGATTT 1740 
1259 

1260 CGATCTGTAT TCATTTTGAA TCACAGCTAT AATAGTCATT TGAGTAGTAG TGTTTTTAAA 1800 
1261 

1262 TGAACATGTT TTGTTGTATT GATGGAACAA ACAGGCAGCA ACAACGAGGA TTAGTTTCCA 1860 
1263 

1264 GAAGCCAGCT TTGGTTTCAA CGACTAATCT CTCCTTCAAC CTCCGCCGTT CAATCCCCAC 1920 
1265 

1266 TCGTTTCTCA ATCTCCTGCG CGGTATGTTC TCATTCTCAG CATTTATTTC GAGCTTGCTT 1980 
1267 

1268 GTCATGGTAC TCTCTCTAAT TGTCTATTTG GTTTATTAGG CCAAACCAGA GACGGTTGAG 2040 
1269 

1270 AAAGTGTCTA AGATAGTTAA GAAGCAGCIA TCACTCAAAG ACGACCAAAA GGTCGTTGCG 2100 
1271 

1272 GAGACCAAGT TTGCTGATCT TGGAGGAGAT TCTCTCGACA CTGTAAGTCA TCAATCATTC 2160 
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1273 

1274 TCXTATGTGA ATAAA6AOAA CTTGAAGAGT TTGTTTTTAA CATATTAACT GAOTGTTTTG 2220 
1275 

1276 CATGCAGGTT GAGATAGTGA TGGGTTTAGA GGAAGAGTTT GATATCGAAA TGGCTGAAGA 2280 
1277 

1278 GAAAGCTCAG AAGATTGCTA CTGTGGAGGA AGCTGCTGAA CTCATTGAAG AGCTCGTTCA 2340 
1279 

1280 ACTTAAGAAG TAATTTTAGT ATTAAGAGCA GCCAAGGCTT TGTTGGGTTT GTTGTTTTCA 2400 
1281 

1282 TAATCTTCCT GTCATTTTCT TTTTCTTTAA TGTGTCAAGC GACTCTGTTG GTTTAAAGTA 2460 
1283 

1284 GTATCTGTTT GCGATGGATC TCTCTCTATT TGTCGACTGA AAACTTTTGG TTTACACATG 2520 
1285 

1286 AAAGCTTGTT CTTGTTCTTT CXTAAATCGA AATGCCAAAT GCGAGATTAG GGAATCTTGT 2580 
1287 

1288 ATTAACACAT ACATAAGTCA AAGAGTAGGC CCTAAGATGA CAATTTATAA ACAATCCTAT 2640 
1289 

1290 TCACATTGTA TATACAGGTT ATGATTATTC CCAATCAGCG TCAAAGAATC CAGCATCTTT 2700 
1291 

1292 CATCTCTGAA TAGTAGACAT TCTCCAAGTT CACATCTTCC TCCTGCACCA AAAACCAGTA 2760 
1293 

1294 CTAAATCATG AACATTGCAA TAATCACATG CCTAGGCGAG AGTTTTGGTG ATGTGGTGTT 2820 
/ 1295 

1296 AGTGATAGTG ATACTGATGG TGCTAGAGCG GTTAAGAAGG ATTAACCTGG AAGAAGTCTG 2880 
1297 

1298 CAAGGAAAGT AACATAGAGA AGAGGAAGAT AGGAGTGGTA ACAAACACTT GTGATCCCAT 2940 
1299 

1300 ACAGCCTCCC AGCATTTTTC AAATGTTATT TCCTTACATA AAGAAACAAG AGAAGTCTGA 3000 
1301 

1302 CTAGATGATA TTTATATAGG ATAAGTGTTT TACCATAAGC CAAAGTGAGC GCCGTTTGCA 3060 
1303 

1304 AGAGCTAACC AGACAGTACA CGTTTGGCAT ATATCTCATC AACATGATCT GAAAAGTAAC 3120 
1305 

1306 ATATCACAGT TAATGAACAC AATGGTTACC TTGAGAAGCA AATCAAGACC TATAACAAGC 3180 
1307 

1308 CCAGAGATGA GGAAAGTCCG TGTCAACGCT TCACCGCCAT TCGCGTAGTT TCCTTGGAAG 3240 
1309 

1310 ACAAAGGCCA CCAACCAAAC TTACTTCCAG AAACAACACT CCAAATGTTG TCAACAAAGT 3300 
1311 

1312 CAATAGATTC CAAACTACTT CGTTACAGGG TTGTATAGAT AATATAATAG AATAGIGGGA 3360 
1313 

1314 AGATAGTATA AATAAAATAA ATAAAAGATC CTATCGGTAA ATAGTTTATA ATATCGGGGG 3420 
1315 

1316 CGTATATAAA GTATAAAAGA AACTCTTCTC CAATCCGACC GTTGAAAATC ACTCTCAATC 3480 
1317 

1318 TCTGGCGTAA CGACCGGATC GTTCGCGCGT AATTTTCGCT GCTATAAATA GAAACTTTCC 3540 
1319 

1320 TCTTCTGTTT CTCGATCAAA ATTTTTTTTT GGAAAAATTA AGTTTGAATC TATCGTAGAT 3600 
1321 

1322 GCTGTGACAA AAAAAAATTG TTTTATCGAA GATGAGAAAC ATGAGGCCTG TTCATGCAAG 3660 
1323 

1324 GAACCAGACC ACGGATCCAT CTXCGCCGAT GATGACGTCT CCTCTGATGA ATCGTCACGC 3720 
1325 
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1326 ACGGACAGGA TCCAACGCTG GACGAGCATC TAAC6CCAA6 AAA6CACA6A CGAAAGCAGC 3780 
1327 

1328 AGCTCAGAGA CTCGCGGCTG TGATGTCGAA CCAAAGAGGC GACGATGAAG ACAGTGATGA 3840 
1329 

1330 TGACCTTTCC TTTGACTACA ACGCTGTCGG AAGCATTGGT CTCGCTGCCG GAAGATCT 3898 

1331 

1332 

1333 (2) INFORMATION FOR SEQ ID NO: 29: 
1334 

1335 (i) SEQUENCE CHARACTERISTICS: 

1336 

1337 (A) LENGTH: 4325 base pairs 

1338 

1339 (B) TYPE: nucleic acid 

1340 

1341 (C) STRANDEDNESS: double 

1342 

1343 (D) TOPOLOGY: linear 

1344 

1345 (ii) MOLECULE TYPE: genomic DNA 

1346 

1347 

1348 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
1349 

1350 CTCGAGGCAG TCACTAACAT GAAGTTTGAC GAGGAGCCCA ACTATGGGAA GCTTATTTCT 60 
1351 

1352 CTTTTCGATA CTCTAATTGA GCCGTGCGCT CTATCTAGAC CAATTAGAAT TGATGGAGCT 120 
1353 

1354 CTAAAGGTTG CTGGCTGTTT TCTTGTTCAT ATGATTAACT TCTAAACTTG TGTATAAATA 180 
1355 

1356 TTCTCTGAAA GTGCTTCTTT TGGCATATGT AGGTTGGGCA AAAACGAGGA AGATTGCTTC 240 
1357 

1358 TCAATTTGGA AGAGGATGAA CAGCCGAAGA AGAAAATAAG AATAGGCAGT CCTGCTACTC 300 
1359 

1360 AATGGATCTC AGTCTATAAC GGTCGTCGTC CCATGAAACA GAGGTAAAAC ATTTTTTGCA 360 
1361 

1362 TATACACTTT GAAAGTTCCT CACTAACTGT GTAATCTTTT GGTAGATATC ACTACAATGT 420 
1363 

1364 CGGAGAGACA ANGGCTGSNC ANCATATACA AAAGGGAAAT GAAGATGGCC TTTTGATTAG 480 
1365 

1366 CTGTGTAGCA TCAGCAGCTA ATCTCTGGGC TCTCATCATG GATGCTGGAA CTGGATTCAC 540 
1367 

1368 TTCTCAAGTT TATGAGTTGT CACCGGTCTT CCTACACAAG GTAATAATCA GTTGAAGCAA 600 
1369 

1370 TTAAGAATCA ATTTGATTTG TAGTAAACTA AGAAGAACTT ACCTTATGTT TTCCCCGCAG 660 
1371 

1372 GACTGGATTA TGGAACAATG GGAAAAGAAC TACTATATAA GCTCCATAGC TGGTTCAGAT 720 
1373 

1374 AACGGGAGCT CTTTAGTTGT TATGTCAAAA GGTTAGTGTT TAGTGAATAA TAAACTTATA 780 
1375 

1376 CCACAAAGTC TTCATTGACT TATTTATATA CTTGTTGTGA ATTGCTAGGA ACTACTTATT 840 
1377 

1378 CTCAGCAGTC ATACAAAGTG AGTGACTCAT TTCCGTTCAA GTGGATAAAT AAGAAATGGA 900 
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1379 

1380 AAGAAGATTT TCATGTAACC TCCATGACAA CTGCTGGTAA TCGTTGGGGT GTGGTAATGT 960 
1381 

1382 CGAGGAACTC TGGCTTCTCT GATCAGGTAG GTTTTTGTCT CTTATTGTCT GGTGTTTTTA 1020 
1383 

1384 TTTTCCCCTG ATAGTCTAAT ATGATAAACT CTGCGTTGTG AAAGGIGGTG GAGCTTGACI 1080 
1385 

1386 TTTTGTACCC AAGCGATGGG ATACATAGGA GGTGGGAGAA TGGGTATAGA ATAACATCAA 1140 
1387 

1388 TGGCAGCAAC TGCGGATCAA GCAGCTTTCA TATTAAGCAT ACCAAAGCGT AAGATGGTGG 1200 
1389 

1390 ATGAAACTCA AGAGACTCTC CGCACCACCG CCTTTCCAAG TACTCATGTC AAGGTTGGTT 1260 
1391 

1392 TCTTTAGCTT TGAACACAGA TTTGGATCTT TTTGTTTTGT TTCCATATAC TTAGGACCTG 1320 
1393 

1394 AGAGCTTTTG GTTGATTTTT TTTTCAGGAC AAATGGGCGA AGAATCTGTA CATTGCATCA 1380 
1395 

1396 ATATGCTATG GCAGGACAGT GTGCTGATAC ACACTTAAGC ATCATGTGGA AAGCCAAAGA 1440 
1397 

1398 CAATTGGAGC GAGACTCAGG GTCGTCATAA TACCAATCAA AGACGTAAAA CCAGACGCAA 1500 
1399 

1400 CCTCTTTGGT TGAATGTAAT GAAAGGGATG TGTCTTGGTA TGTATGTACG AATAACAAAA 1560 
1401 

1402 GAGAAGATGG AATTAGTAGT AGAAATATTT GGGAGCTTTT TAAGCCCTTC AAGTGTGCTT 1620 
1403 

1404 TTTATCTTAT TGATATCATC CATTTGCGTT GTTTAATGCG TCTCTAGATA TGTTCCTATA 1680 
1405 

1406 TCTTTCTCAG TGTCTGATAA GTGAAATGTG AGAAAACCAT ACCAAACCAA AATATTCAAA 1740 
1407 

1408 TCTTATTTTT AATAATGTTG AATCACTCGG AGTTGCCACC TTCTGTGCCA ATTGTGCTGA 1800 
1409 

1410 ATCTATCACA CTAGAAAAAA ACATTTCTTC AAGGTAATGA CTTGTGGACT ATGTTCTGAA 1860 
1411 

1412 TTCTCATTAA GTTTTTATTT TCTGAAGTTT AAGTTTTTAC CTTCTGTTTT GAAATATATC 1920 
1413 

1414 GTTCATAAGA TGTCACGCGA GGACATGAGC TACACATCGC ACATAGCATG CAGATCAGGA 1980 
1415 

1416 CGATTTGTCA CTCACTTCAA ACACCTAAGA GCTTCTCTCT CACAGCGCAC ACACATATGC 2040 
1417 

1418 ATGCAATATT TACACGTGAT CGCCATGCAA ATCTCCATTC TCACCTATAA ATTAGAGCCT 2100 
1419 

1420 CGGCTTCACT CTTTACTCAA ACCAAAACTC ATCACTACAG AACATACACA AATGGCGAAC 2160 
1421 

1422 AAGCTCTTCC TCGTCTCGGC AACTCTCGCC TTGTTCTTCC TTCTCACCAA TGCCTCCGTC 2220 
1423 

1424 TACAGGACGG TTGTGGAAGT CGACGAAGAT GATGCCACAA ATCCAGCCGG CCCATTTAGG 2280 
1425 

1426 ATTCCAAAAT GTAGGAAGGA GTTTCAGCAA GCACAACACC TGAAAGCTTG CCAACAATGG 2340 
1427 

1428 CTCCACAAGC AGGGAATGCA GTCCGGTAGT GGTCCAAGCT GGACCCTCGA TGGTGAGTTT 2400 
1429 

1430 GATTTTGAAG ACGACGTGGA GAACCAACAA CAGGGCCCGC AGCAGAGGCC ACCGCTGCTC 2460 
1431 
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1432 
1433 
1434 
1435 
1436 
1437 
1438 
1439 
1440 
1441 
1442 
1443 
1444 
1445 
1446 
1447 
1448 
1449 
1450 
1451 
1452 
1453 
1454 
1455 
1456 
1457 
1458 
1459 
1460 
1461 
1462 
1463 
1464 
1465 
1466 
1467 
1468 
1469 
1470 
1471 
1472 
1473 
1474 
1475 
1476 
1477 
1478 
1479 
1480 
1481 
1482 
1483 
1484 


CAGCAGTGCT 
GGAGCATCCA 
CAGCAGATGC 
TGCAACATCA 
TACTAGATTC 
GTTGATGTAT 
AATAAGAACT 
TTCCTATAAA 
ACAATGAAGC 
AAGCCTTCAA 
CTGGTTTTGA 
TTTATTTGGT 
TTCAGTTTTA 
ATAGTTTTAT 
CCACATATCC 
ATTATGAGTT 
ACGTGATCTT 
AGTAAACTCT 
CAGCAAACAT 
TATTTCATTT 
GAGAAGTCAC 
AAAGAACAAA 
GGAGGAAGAG 
GGACGTTTTG 
GTGAAGTCAT 
GTTTTACCGA 
CIGTTTCCTG 


GCAACGAGCT 
AAGCCGTTAA 
AGCAAGTGAT 
GGCAAGTTAG 
CAAACGAATA 
GTTAACACTA 
ACTCCGTAGA 
GTGATGATTA 
AGTACTAGIC 
GGCTAGTGAT 
TCAGATACTT 
GAAGACTCTT 
AGATTTGTTA 
TTATATAATG 
AATATATTAA 
TTGTTATTTA 
CATTTCTTGG 
GTTCTTGGTT 
CGCAAACCAT 
GGTGAAAACC 
CACACCTCTC 
TAAAACCCGA 
AATGRCGGCG 
GTGGCGGCGG 
TGGTTCGTTT 
TAAAGCTTAA 
CTTATTAGTG 


CCACCAGGAA 
ACAACAGATT 
TAGCCGTATC 
CATTTGCCCC 
TCCTCGAGAG 
CATAGTCATG 
CGGTAATAAA 
ACAACAGATA 
TATTGAACAT 
GATAAAAGAT 
CAAAACTATT 
TAGAAGCAAA 
TTGACTTATT 
CTTGTCTATT 
GTTTCATTTC 
CCTGAAGAAA 
CTAAAAGCGA 
TTTGGTTTAA 
ATGTCAATTC 
CTAGAAGCCA 
CACTAAAACC 
AGATGAGACC 
GNSNTTTGGT 
TGGACCTTTG 
ACTCTTTTCT 
GACTTTATTG 
TTCCTTTGTT 


GAGCCACTTT 
CGACAACAAC 
TACCAGACCG 
TTCCAGAAGA 
TGTGTATACC 
GTGTGTGTTC 
AGAGAAGTTT 
CACCAAAAAG 
GTCAGATTTT 
CAT C CAATGG 
TTTGTATTCA 
GAACGACAAG 
GTCATTTGAA 
CAAGATTTGA 
TGTTCAAACA 
AGATAAGTGA 
ATATGACATC 
TCAAACCGAA 
GTTAGATTCC 
GCCANCCTTT 
CTGAACCTTA 
ACCACGTGCG 
GGCGGCGGCG 
GTGGTGGATA 
TAGTCGAATC 
ATAAAGTTCT 
TTGAGTTGAA 


GCGTTTGCCC 
AGGGACAACA 
CTACGCACTT 
CCATGCCTGG 
ACGGTGATAT 
CATAAATAAT 
TTTTTTTTAC 
AAAACAATTA 
CTTTTTCTAA 
GATCCAACAA 
TTAAATTATG 
CAGTAATAAA 
AAATATAGTA 
GAACATTAAT 
TATGATAAGA 
GCTTCGAGTT 
ACCTAGAGAA 
CCGGTAGCTG 
CGGTTTAAGT 
TTAATCTAAT 
CTGAGAGAAG 
GCGGGACGTT 
GACGTTTTGG 
TCGTGACGAA 
TTATTCTTGC 
CAGCTTTGAA 
TCACTGTCTT 


AACCTTGAAA 2520 
AATGCAGGGA 2580 
ACCTAGAGCT 2640 
GCCCGGCTTC 2700 
GAGTGTGGTT 2760 
GTACTAATGT 2820 
TCTTGCTACT 2880 
ATCTATATTC 2940 
ATGTCTAATT 3000 
AGACTCAAAT 3060 
CAAGTGTTCT 3120 
AAAAACAAAG 3180 
TGATATTAAT 3240 
ATGATACTGT 3300 
TGGTCAAATG 3360 
TCTGAAGGGT 3420 
AGCCGATAAT 3480 
AGTGTCAAGT 3540 
TGTAAACCGG 3600 
TTTTGCAAAC 3660 
CAGAGNCANN 3720 
CAGGGGACGG 3780 
TGGCGGCGGT 3840 
GGACCTCCCA 3900 
TCTGCTCGTT 3960 
TGTGAATGAA 4020 
AGCACTTTTG 4080 
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1485 
1486 
1487 
1488 
1489 
1490 
1491 
1492 
1493 
1494 
1495 
1496 
1497 
1498 
1499 
1500 
1501 
1502 
1503 
1504 
1505 
1506 
1507 
1508 
1509 
1510 
1511 
1512 
1513 
1514 
1515 
1516 
1517 
1518 
1519 
1520 
1521 
1522 
1523 
1524 
1525 
1526 
1527 
1528 
1529 
1530 
1531 
1532 
1533 
1534 
1535 
1536 
1537 


TTAGATTCAT CTTTGTGTTT AAGTTAAAAG GTAGAAACTT TGTGACTTGT CTCCGTTATG 4140 
ACAAGGTTAA CTTTGTTGGT TATAACAGAA GTTGCGACCT TTCTCCATGC TTGTGAGGGT 4200 
GATGCTGTGG ACCAAGCTCT CTGAGGCGAA GATCCCTTAC TTCAATGCCC CAATCTACTT 4260 
GGAAAACAAG ACACAGATTG GGAAAGTTGA TGAGATCCAA GCTTGGGCTG CAGGTCGACG 4320 
AATTC 4325 
(2) INFORMATION FOR SEQ ID NO: 30: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
CGGATCCACT GCAGTCTAGA GGGCCCGGGA 30 

(2) INFORMATION FOR SEQ ID NO: 31: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
AATTTCCCGG GCCCTCTAGA CTGCAGTGGA TCCGAGCT 38 
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1538 
1539 
1540 
1541 
1542 
1543 
1544 
1545 
1546 
1547 
1548 
1549 
1550 
1551 
1552 
1553 
1554 
1555 
1556 
1557 
1558 
1559 
1560 
1561 
1562 
1563 
1564 
1565 
1566 
1567 
1568 
1569 
1570 
1571 
1572 
1573 
1574 
1575 
1576 
1577 
1578 
1579 
1580 
1581 
1582 
1583 
1584 
1585 
1586 
1587 
1588 
1589 
1590 


(2) INFORMATION FOR SEQ ID NO: 32: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
GTAAGTAGGT AGGGCTTCCT CTGTAATCAT ATCTCCAACC AAAACAACAA 50 

(2) INFORMATION FOR SEQ ID NO: 33: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
CTTAAGAAGT AACCCGGGCT GCAGTTTTAG TATTAAGAG 39 

(2) INFORMATION FOR SEQ ID NO: 34: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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1591 
1592 
1593 
1594 
1595 
1596 
1597 
1598 
1599 
1600 
1601 
1602 
1603 
1604 
1605 
1606 
1607 
1608 
1609 
1610 
1611 
1612 
1613 
1614 
1615 
1616 
1617 
1618 
1619 
1620 
1621 
1622 
1623 
1624 
1625 
1626 
1627 
1628 
1629 
1630 
1631 
1632 
1633 
1634 
1635 
1636 
1637 
1638 
1639 
1640 
1641 
1642 
1643 


(D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
GGAATTCGTC GACAGATCTC TGCAGCTCGA GGGATCCAAG CTT 43 

(2) INFORMATION FOR SEQ ID NO: 35: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
CCATTTTTGA TCTTCCTCGA GCCCGGGCTG CAGTTCTTCT TCTTCTTG 48 

(2) INFORMATION FOR SEQ ID NO: 36: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
GCTCGTTTTT TTTTTCTCTG CAGCCCGGGC TCGAGTCACA GCTTCACC 48 
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1644 
1645 
1646 
1647 
1648 
1649 
1650 
1651 
1652 
1653 
1654 
1655 
1656 
1657 
1658 
1659 
1660 
1661 
1662 
1663 
1664 
1665 
1666 
1667 
1668 
1669 
1670 
1671 
1672 
1673 
1674 
1675 
1676 
1677 
1678 
1679 
1680 
1681 
1682 
1683 
1684 
1685 
1686 
1687 
1688 
1689 
1690 
1691 
1692 
1693 
1694 
1695 
1696 


(2) INFORMATION FOR SEQ ID NO: 37: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
ACTGACTGCA GCCCGGGCTC GAGGAAGATC AAAAATGGCT CTTC 44 

(2) INFORMATION FOR SEQ ID NO: 38: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
GAGTAGTGAA CTTCATGGAT CCTCGAGGTC TTGAAAACCT AGA 43 

(2) INFORMATION FOR SEQ ID NO: 39: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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1697 
1698 
1699 
1700 
1701 
1702 
1703 
1704 
1705 
1706 
1707 
1708 
1709 
1710 
1711 
1712 
1713 
1714 
1715 
1716 
1717 
1718 
1719 
1720 
1721 
1722 
1723 
1724 
1725 
1726 
1727 
1728 
1729 
1730 
1731 
1732 
1733 
1734 
1735 
1736 
1737 
1738 
1739 
1740 
1741 
1742 
1743 
1744 
1745 
1746 
1747 
1748 
1749 


(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
CAATGTCTTG AGAGATCCCG GGATCCTTAA CAACTAGGAA AAGG 44 

(2) INFORMATION FOR SEQ ID NO: 40: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
GTAAGACACG ACTTATCGCC ACTG 24 

(2) INFORMATION FOR SEQ ID NO: 41: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: synthetic oligonucleotide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
GGAATTCGTC GACAGATCTC TGCAGCTCGA GGGATCCAAG CTT 43 


• * 
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1750 (2) INFORMATION FOR SEQ ID NO: 42: 
1751 

1752 (i) SEQUENCE CHARACTERISTICS: 

1753 

1754 (A) LENGTH: 32 base pairs 

1755 

1756 (B) TYPE: nucleic acid 

1757 

1758 (C) STRANDEDNESS : single 

1759 

1760 (D) TOPOLOGY: linear 

1761 

1762 (ii) MOLECULE TYPE: other nucleic acid 
1763 

1764 (A) DESCRIPTION: synthetic oligonucleotide 

1765 

1766 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
1767 

1768 GCTTGTTCGC CATGGATATC TTCTGTATGT TC 32 
1769 

1770 (2) INFORMATION FOR SEQ ID NO: 43: 

1771 

1772 

1773 (i) SEQUENCE CHARACTERISTICS: 

1774 

1775 (A) LENGTH: 143 base pairs 

1776 

1777 (B) TYPE: nucleic acid 

1778 

1779 (C) STRANDEDNESS: double 

1780 

1781 (D) TOPOLOGY: linear 

1782 

1783 (ii) MOLECULE TYPE: cDNA to mRNA 

1784 j 
1785 

1786 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
1787 

1788 GAT GCC AAA ANG CCT CAC ATG CCT CCT AGA GAA GCT CAT GTG CAA AAG 48 

1789 Asp Ala Lys Xaa Pro His MET Pro Pro Arg Glu Ala His Val Gin Lys 

1790 1 5 10 15 
1791 

1792 ACC CAT TCA ATK CCG CCT CAA AAG ATT GAG ATT TTC AAA TCC TTG GAG 96 

1793 Thr His Ser Xaa Pro Pro Gin Lys lie Glu lie Phe Lys Ser Leu Glu 

1794 20 25 30 
1795 

1796 GGT TGG GCT GAG GAG AAT GTC TTG GTG CAT CTT AAA CCT GTG GAG AA 143 

1797 Gly Trp Ala Glu Glu Asn Val Leu Val His Leu Lys Pro Val Glu 

1798 35 40 45 
1799 

1800 
1801 
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